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DETAILED ACTION 



1. This action is responsive to the Applicant's response filed 1/12/2004. 

As indicated in Applicant's response, claims 1, 22, 43 and 44 have been amended. 
Claims 1-44 are pending in the office action. 



2. The following is a quotation of 35 U.S. C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

3. Claims 1-9, 19-30, and 41-44 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Tseng et al., USPN: 6,009,256 (hereinafter Tseng), in view of Schlansker et al, USPN 
6,408,428 ( hereinafter Schlansker), and further in view of Kolchinsky et al., USPN: 5,535,406 
(hereinafter Kolchinsky). 

As per claim 1, Tseng discloses a method of creating run time executable code, 
comprising: 

partitioning a processing element (PE) array into a plurality of hardware accelerators (e.g. 
col. 1, lines 45-59; FPGA chip, col. 10, lines 2-22; col. 12, lines 45-55 - Note: array of chips on 
to be reconfigured into circuits is equivalent to array of PEs being partitioned in plurality of 
hardware accelerators) 

decomposing a program source code ( e.g. col. 58, line 62 to col. 59, line 50; Fig. 4, 26, 
28; Verilog, VHDL, parsing process, compiler 210 - col. 15, line 62 to col. 16, 38 - Note: 
language that is programmed from human readable format then parsed and compiled into 



Claim Rejections - 35 USC § 103 
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machine readable constructs are source program code, e.g. Verilog and VHDL) into a plurality of 
kernel sections (e.g. code component -col. 13, lines 27-36; HDL code description, component- 
col 1 1 , lines 47- 64; RTL level . . . signals - col. 60, lines 53-62; Fig. 29 - Note: HDL code 
component analysis is equivalent to decomposing into kernel sections; steps 301, 302, 304, 310, 
Fig. 4) 

(i) mapping said plurality of kernel sections into a plurality of hardware dependent model 
entities representation language (e.g. hardware model, gate level, - col. 11, lines 47- 64; RTL - 
col. 11, line 50 to col. 12, line 6; hardware execution models - col. 13, lines 36-40 ); and 

generating a mapping between said hardware accelerators and said hardware dependent 
model entities configured to support run time execution (e.g. steps 306, 307, 309 - Fig. 4 ; Fig. 6; 
col. 19, lines 14-21; col. 21, lines 36-43 ) of the plurality of kernel sections (e.g. Fig. 25-30). 

But Tseng does not explicitly disclose that the mapping in (i) is mapping of kernel 
sections into a plurality of hardware dependent executable code for execution on a plurality of 
hardware accelerators and that said hardware dependent executable code is to support runtime 
execution of the kernel sections. However, Tseng discloses a programming language derived 
from the hardware dependent model entities, e.g. gate level, for support of execution or 
simulation/debug on said plurality of hardware accelerators (e.g. Fig. 25-30; col. 11, line 50 to 
col. 12, line 6 - Note: RTL level signal used in program to effect signaling of a FPGA and 
support emulation/simulation/debug of hardware circuit is equivalent to mapped kernel sections 
code for executing hardware accelerators in FPGA). Mapping of design model specification 
language into corresponding executable code was a known concept in the art of 
software/hardware emulation/simulation and design at the time of the invention. Likewise, 
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Schlansker, in a method to find an optimum design for match hardware parameters and 
performance requirements to a particular type of processors, i.e. to translate into processor 
instruction code the corresponding parameterization specifics representing a hardware dependent 
requirements analogous to the mapping of kernels to hardware RTL language by Tseng, 
discloses mapping of instruction code sets via intermediate code against processor/hardware 
parameters or kernel sections as claimed (e.g. parametization, opsets, opgroups - col. 9, line 24 
to col. 10, line 32; AIR - col 19, line 23 to col. 21, line 4; Fig. 12-15). It would have been 
obvious for one of ordinary skill in the art at the time the invention was made to enhance the 
intermediate gate level program code as taught by Tseng with the mapping of such code into 
executable instruction groups as suggested by Schlansker, in case Tseng does not already include 
one such mapping, because this would help create a corresponding set of executable that best 
suits the hardware specification of the accelerators without extraneous use of resources, averting 
thereby hardware architecture/code operation non-compliance as intended by Schlansker. 

Nor does Tseng specify forming a matrix describing said hardware accelerators and said 
hardware dependent designs configured to support run time execution. Tseng, however, 
discloses mapping of hardware models or logic component to reconfigurable boards or circuit 
(e.g. Fig. 6; col. 21, line 63 to col. 22, line 8; step 309 - Fig. 4) and optimizing functions in 
interconnecting processing elements using a matrix (e.g. col. 25, line 49 to col. 26, line 15). 
Likewise, Schlansker teaches corresponding of instruction sets to a particular processor ( Fig. 1- 
3, 14) and use of template and matrices to map instruction groups or templates against execution 
constraints/parameters or data flow conditions ( Fig. 17-20 - Note: execution conditions implies 
machine specific execution, hardware dependency). Further, Kolchinsky, in a method for 
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processing applications using programmable logic device, e.g. FPGA, in executing a 
multitasking kernel with logic design configuration and encoded hardware algorithm coupling 
analogous to the hardware/software modeling and simulation process by Tseng, discloses using a 
programmable logic matrix array for mapping execution operations to criteria of the hardware 
control and configuration (e.g. col. 5, lines 35-65) to perform a specific algorithm. In view of 
Tseng's suggesting of the use of connectivity matrix and mapping of software logic or models to 
processing elements or circuits, it would have been obvious for one of ordinary skill in the art at 
the time the invention was made to use the matrix array as suggested by Schlansker and taught 
by Kolchinsky and apply it to the descriptively mapping of hardware modeled from software 
components (designs as claimed) to the reconfigurable target circuits (accelerators as claimed) as 
taught by Tseng to support execution of kernel sections. One of ordinary skill would be 
motivated to do so because this would provide more visibility to the mapping process and added 
flexibility, hence cost-efficiency, for modification or correction in general and specially when the 
target architecture increases in complexity (e.g. Kolchinsky: col. 1, line 61 to col. 2, line 4). 

As per claims 2 and 3, the combination Tseng/Schlansker further discloses partitioning 
into digital signal processors ( re claim 2) and into bins ( re claim 3) (Tseng: EAB, DSP — col 
51, lines 27-34 ; cluster - col. 22, line 46-57; Schlansker: instruction groups - Fig. 16) 

As per claim 4, Tseng further discloses mapping includes mapping into multiple 
hardware contexts ( e.g. col. 22, line 12-45; Fig. 28 - Note: grouping by gate netlist/Register 
Translation, or chip cluster logic/common clock is equivalent to mapping for multiple hardware 
context). 
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As per claims 5 and 6, Tseng further discloses mapping ( re claim 5) a first set of 
invariants (e.g. FPGA component lib 361, logic function 359 - Fig. 6) and that said first set of 
variants are produced ( re claim 6) based on resource usage (e.g. RTL 358, Fig. 6; combinational 
component 303, Fig. 4 - Note: register allocation is equivalent to resource usage ). 

As per claim 7, Tseng further discloses mapping a second set of variants of said designs 
configured to support multiple hardware configurations of one of a plurality of bins (e.g. circuit 
design components, gate netlist - col. 22, lines 1-23). 

As per claim 8, Tseng further discloses mapping is performed by a place and route ( e.g. 
steps 352, 353, 354, 355- Fig. 6). 

As per claim 9, Tseng further discloses the decomposition step is performed manually 
{user -- e.g. col. 17, lines 13-17). 

As per claim 19, Tseng further discloses that mapping includes creating context 
dependent configurations (e.g. col. 22, line 12-45; components 901, 903, 904-Fig. 28 - Note: 
configuring by gate netlist/Register Translation language, or chip cluster logic/common clock 
context is equivalent to mapping for multiple context dependent configuration; code components 
are equivalent to code context configurations). 

As per claims 20 and 21, Tseng does not explicitly teach that the matrix used in the 
mapping is sparely ( re claim 20) or fully ( re claim 21) populated; but discloses the connectivity 
matrix and mapping of FPGA circuits to hardware models ( e.g. Fig. 7, 16; col. 25, line 49 to col. 
26, line 15), the component type used for defining partial or generalized connectivity in the array 
of processing elements (e.g. col. 18,lines 20-41). Combining Tseng's teaching with 
Schlansker/Kolchinsky's use of a matrix to map design logic to configuration resources as 
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mentioned in claim 1 , the limitation on such matrix being populated as claimed herein would 
have been obvious because of the same rationale mentioned in claim 1; and also because Tseng's 
component type analysis as mentioned above would imply sparsely or fully populating of the 
matrix as mentioned in claim 1 . 

As per claim 22, this claim is a system claim corresponding to claim 1 above and 
includes most of the limitations therein using Tseng's disclosure, namely: 

plurality of hardware accelerators partitioned (e.g. col. 1, lines 45-59; FPGA chip, col. 
10, lines 2-22; col. 12, lines 45-55 ); 

plurality of kernel sections (e.g. HDL language, code component -col. 13, lines 27-36; 
HDL code description, component— col. 1 1 , lines 47- 64 ) 

created from a program source code (e.g. col. 58, line 62 to col. 59, line 50; Fig. 4, 26, 
28; Verilog, VHDL, parsing process, compiler 210 - col. 15, line 62 to col. 16, 38 ) for execution 
on said plurality of hardware accelerators (e.g. Fig. 25-30); 

plurality of hardware dependent model representation language (e.g. hardware model, 
RTL, gate level, - col. 11, lines 47- 64; hardware execution models - col. 13, lines 36-40 ); 

(1) mapping of accelerators and kernel designs for run time execution (e.g. steps 306, 
307, 309 - Fig. 4 ; Fig. 6; col. 19, lines 14-21; col. 21, lines 36-43); hence is rejected using the 
corresponding rejection as applied in claim 1 above, using Tseng's teachings. 

However, like in claim 1 , Tseng does not explicitly teach that the model representation 
language for said kernel sections are executable code for execution on said plurality of hardware 
accelerators and that the mapping of hardware accelerators and such executable is configured to 
support runtime execution. But this limitation has been addressed in claim 1 using Schlansker. 
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Nor does Tseng disclose a matrix used for said mapping in (1), but such matrix limitation 
has been addressed by Tseng combined with Schlansker/Kolchinsky as mentioned in claim 1 
above; and is herein rejected using the same rationale set forth therein. 

As per claims 23-30, these claims are system claims corresponding to claims 2-9, 
respectively; hence, are rejected using the corresponding rejections set forth therein, respectively. 

As per claims 41-42, these claims are similar to claims 20-21 above, respectively; hence 
are rejected herein using the same grounds set forth therein. 

As per claim 43, this claim is a computer-readable medium version of claim 1, above 
hence includes all the step limitations therein and is rejected herein using the same corresponding 
rejections set forth therein. 

As per claim 44, this claim is a system version of claim 1, above hence includes all the 
step limitations therein and is rejected herein using the same corresponding rejections set forth 
therein 

4. Claims 10-17, and 31-38 are rejected under 35 U.S.C 103(a) as being unpatentable over 
Tseng et al.,USPN: 6,009,256, in view of Schlansker et al., USPN 6,408,428, and Kolchinsky et 
al., USPN: 5,535,406, as applied to claims 1 and 22 above, and further in view of Trimberger, 
USPN: 5,752,035 (hereinafter Trimberger). 

As per claim 10, Tseng does not specify that the decomposition step is performed by a 
software profiler; but discloses deriving of gate combinational elements based on type analysis 
(e.g. col. 18, lines 22-41; step 302, Fig. 4) for network array execution path differentiation; and 
profiling functions to optimize the cost for placement of processing elements in the array (e.g. 
col. 23, line 58 to col. 23, line 21; Fig. 6). Schlansker, in the method ( re claim 1) to match an 
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instruction set for a particular accelerators against data flow conditions and parametric 
requirements, discloses organizing instruction code into set and groups for evaluation against 
operational statistics (e.g. Fig. 16-20 - Note: collection of statistics and metrics for performance 
evaluation is equivalent to profiling) and Trimberger, in a method for compiling and executing 
programs using re-programmable logic accelerator sets analogous to the FPGA accelerators used 
in Tseng's method (combined with Schlansker/Kolchinsky) for simulation, discloses the use of 
profiling to detect the most frequently executed code and replace it with programmable 
instructions unit, or accelerator set (e.g. col. 6, lines 31-56). It would have been obvious for one 
of ordinary skill in the art at the time the invention was made to use a profiler as suggested by 
Schlansker and taught by Trimberger to detect optimizable portions of code for submission into 
accelerator logic units as taught by Trimberger and combine it with the analysis and optimization 
techniques above-mentioned by Tseng (combined with Schlansker/Kolchinsky) because this 
would provide additional performance improvement of the program execution/simulation 
performed in Tseng's invention (combined with Schlansker/Kolchinsky). 

As per claim 11, Tseng discloses that the decomposing step includes executing code 
from program source code and monitoring timing of said execution (e.g. col. 20, line 4 to col. 21 , 
line 24; Fig. 6; monitor( Stime...), test-bench component (monitor) 906- Fig. 26,28; Eval Timer 
1004, Fig. 23). 

As per claims 12 and 13, Tseng discloses utilizing set of test data (e.g col. 13, lines 41- 
47; evaluate test-bench components, steps 333, 337-Fig. 5; Figs. 29-30) in the execution of the 
simulation; but does not specify using test data ( re claim 12) in the decomposing step nor does 
Tseng specify that ( re claim 13) said monitoring includes determining functions that consume a 
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significant portion of execution timing. But the limitation of using a profile analysis to 
determine code portions occupying significant execution time prior to optimization has been 
addressed in claim 10 above in view of Schlansker and Trimberger's teachings. In view of 
Tseng's (combined with Schlansker/Kolchinsky's) teaching to partition software simulation and 
hardware simulation (Figs. 29-30) using iterative test-bench components testing as mentioned 
above, it would have been obvious for one of ordinary skill in the art at the time the invention 
was made to modify the decomposition step disclosed by Tseng (combined with 
Schlansker/Kolchinsky) and combine the profiling technique (so suggested by Trimberger) and 
the use of test data (as taught by Tseng) for detecting the code portion susceptible for further 
optimizing, or for accelerating in Tseng's (combined with Schlansker/Kolchinsky) method , 
because this would enhance further the incremental the code optimization or simulation cycle as 
taught by the combination Tseng ( combined with Schlansker/Kolchinsky's teaching). 

As per claim 14, Tseng discloses a decomposition step with identifying kernel sections 
by identifying regular structures (component type - col. 17, lines 24-34). 

As per claim 15, Tseng discloses identifying kernel sections by identifying sections with 
a limited number of inputs and outputs by way of basic blocks (e.g. col. 44, lines 55-65; Figs. 
17,18) of the hardware model, which are register component types fetched from the component 
type analysis, i.e. decomposition step (e.g. col. 18, lines 20-41 — Note: basic blocks are those 
whose inputs and outputs connection are in very limited number). 

As per claim 16, Tseng further discloses identifying kernel sections by identifying 
sections with a limited number of branches (e.g. col. 18, lines 20-41; col. 44, lines 55-65 - Note: 
a skill in the art would view basic blocks as those having limited number of branching) 
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As per claim 17, Tseng discloses decomposing by identifying sections that are overhead 
sections (e.g. software model 215, Fig. 3; col. 16, lines 28-32). 

As per claims 31-38, these claims are system claims corresponding to claims 10-18, 
respectively, hence, are rejected herein using the corresponding rejections set forth therein, 
respectively. 

5. Claims 18, 39, and 40 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Tseng et al., USPN: 6,009,256, in view of Schlansker et al., USPN 6,408,428, and Kolchinsky et 
al., USPN: 5,535,406, as applied to claims 1 and 22 above, and further in view of Mirsky et al., 
USPN: 6,457,1 16 (hereinafter Mirsky). 

As per claim 18, Tseng (combined with Schlansker/Kolchinsky) does not disclose that 
mapping includes creating microcode. Mirsky, in the method for configuring array of processing 
elements in response to state data submitted analogous to the simulation and testing of test-bench 
components applied to the circuit of processing elements disclosed by Tseng, discloses the use of 
microcode ( e.g. col. 10, lines 21-32). In view of the use resource-constrained processing 
elements such as chips and DSP as suggested by Tseng (col. 51, lines 27-34) for partitioning, it 
would have been obvious for one of ordinary skill in the art at the time the invention was made 
to implement microcode to the processing element such as taught by Mirsky to Tseng's 
(combined with Schlansker/Kolchinsky) mapping of hardware configuration to the design circuit 
for carrying out the simulation process as taught by Tseng. One of ordinary skill in the art would 
be motivated to do so because this would alleviate memory storage in small device used as PEs 
in the array of Tseng's system (combined with Schlansker/Kolchinsky), and further improve 
resource usage and time-efficiency. 



Application/Control Number: 09/65 1 ,425 Page 1 2 

Art Unit: 2124 

As per claim 39, this claim corresponds to claim 18 above, hence, is rejected herein 
using claim 18 rejection as set forth above. 

As per claim 40, Tseng discloses code including context dependent configurations ( e.g. 
register component 901, clock component 903, Test-bench component 904 - Fig. 28 - Note: all 
component-related code with configuration for variables are equivalent to context dependency 
configurations); but Tseng fails to disclose microcode in place of code. Such limitation has been 
addressed in claim 18 above and is rejected herein with the same ground of rejection set forth 
therein. 

Response to Arguments 

6. Applicant's arguments filed 1/12/2004 have been fully considered but they are mostly 
moot in view of the new grounds of rejection. Following are the reasons therefor. 

The rejection as submitted now addresses the newly added features being the partitioning 
of kernel sections in an executable code (i.e. the partitioning of VHDL or Verilog specification 
by Tseng into some form of executable to support the execution of Tseng's hardware 
accelerators), and the mapping of hardware and code sections. The rejection points out what part 
is deficient in Tseng's teachings that either Schlansker or Kolchinsky can remedy and as a whole 
how the combination as set forth in the rejection would fulfill the required features of the claims. 

Conclusion 

7. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Tuan A Vu whose telephone number is (703)305-7207. The 
examiner can normally be reached on 8AM-4:30PM/Mon-Fri. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Kakali Chaki can be reached on (703)305-9662. 
Any response to this action should be mailed to: 
Commissioner of Patents and Trademarks 



or: (703) 746-8734 ( for informal or draft communications, please consult Examiner 
before using this number) 

Hand-delivered responses should be brought to Crystal Park II, 2121 Crystal Drive, 
Arlington. VA. , 22202. 4 th Floor( Receptionist). 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 



Washington, DC. 20231 



or faxed to: 



(703) 872-9306 ( for formal communications intended for entry) 
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February 27, 2004 




